RAW SEQUENCE LISTING 



The Biotechnology Systems Branch of the Scientific and Technical 
Information Center (STIC) no errors detected. 

Application Serial Number: /q/%20 } ^57 

Source: 1 IFIaJQ 

Date Processed by STIC: / j^D/Q^ 



ENTERED 



Page 1 of 7 




IFWO 



RAW SEQUENCE LISTING DATE : 01/27/2005 

PATENT APPLICATION: US/10/820,451 TIME: 13:00:24 

Input Set : A:\Mtv07601.app 

Output Set: N:\CRF4\01272005\J820451.raw 

3 <110> APPLICANT: LESSARD, PHILLIP A. 

4 SINSKEY, ANTHONY J. 

6 <120> TITLE OF INVENTION: NOVEL COMPOSITIONS AND METHODS FOR GENETIC MANIPULATION 

7 OF RHODOCOCCUS BACTERIA 

9 <130> FILE REFERENCE: MTV- 076. 01 

11 <140> CURRENT APPLICATION NUMBER: 10/820,451 

12 <141> CURRENT FILING DATE: 2004-04-07 
14 <160> NUMBER OF SEQ ID NOS : 21 

16 <170> SOFTWARE: Patentln Ver. 3.2 

18 <210> SEQ ID NO: 1 

19 <211> LENGTH: 4970 

20 <212> TYPE: DNA 

21 <213> ORGANISM: Rhodococcus sp . 

23 <400> SEQUENCE: 1 

24 gaggaacgtc gggcggtcga gcgcggcgtc gtcgcgtgtg gagttgcgcg agcgggtggt 60 

25 cgaggtcgag aaggtggtgc atcggccggc cacgacggcc gcgtccatcg accgcgtact 12 0 

26 cgacgacgat caggcgctgc agaagctgtt gaacacgctg cggcaccgca tcaaggaagg 180 

27 cacctaccgg cccggcggtg acaggaacat gacgtgggtg ctccggaggc tcacggcccc 240 

28 ggtggccgat ctgctcgccc agatccactc gagtgcaggc accaccgcgt acatcgaccc 300 

29 ggcgcaggcc gtgcaggcag tgctcggctc accacgctcg tgccgcgcgg tcctggaagg 360 
3 0 cctcgacaca atggcccgct ccggagagct cgacggcggc cagcacaccg ccacactgac 420 

31 agccctcgac cggctcgtag ctcacacccg gcagcagagc cgatagaaaa cggagctgcc 480 

32 gcggcaccgc tgcgcgctgc cttgccaccg aaacgatgtc ccttacgacg gggaccgagg 540 

33 gagtgaccgc atgagtcttg atcagcagat agccgagctg cgcgagcagt tgaacgccct 600 

34 cggccgccag gtgcgcgagg tgacagaggc cgccgacgag caggacaagc ccgacgtcgt 660 

35 cgacgccctg gccgacgtgc gcatgcacct caacgacgcc gccgccgtcc tcggggacgt 72 0 

36 gatcgcatga gcgactggac gagctgggtc tgtcccaggt caccgaccca gcaacacgtc 780 

37 cccgctggcc taccgccacg tcacgcccag gtgagggaca ctgacccacg tgaccaaccc 840 

38 agggcagacg ttcaccgtga ccacggccgc caaggcgtgc caggtgtcgc ggaagacgat 900 
3 9 cactcgacgc cttgaccagc tgcgagacgg cggcgcgtac aaggatgcgg ccggcgcgtg 960 

40 ggtgatcccg cttggttcgc tcctccacgc cggtctcagt ccgggtaggc cagcgtcccc 102 0 

41 ggatgctgtc cacgtgtccc aggaggacac acagggacag gaggacagag gctggttcac 1080 

42 cctcgagcaa gtcgcggaat tgcgtaagcg cgccgacgag gcggagtcac ttcgccggcg 1140 

43 ggccgaggtc gcggaagccc tggccacaga gcgggaaaga gtgatcgagg tgcaggcgcg 1200 

44 cgccctgcgg atgctcgagg ccggccaacg gccgaccgcg caccaggaaa cgacgtcaga 1260 

45 ggtttccgcc acgccggccg atgtgccgcc acgccgcggt ccgctagaac gcctggtcgg 1320 

46 acggttcggg ctgtagcgtt tggcaaggtc aagccagcgc cgaccggggc aattggtaga 1380 

47 gcaagtgact cttaatcact gggtcatatg gcgagaccca tggcgattat ctgggaggaa 1440 

48 gacgcggtcg gggacttctg tttcagcgcc gtcgaagtgc ctctgggatg ggttgtttgt 1500 

49 ggaccgtagt cagctggtcg aaggctgcgc cacagacggg gcaaaagcct ttgatgcgcg 1560 

50 cgtggccgcc gacctcgaca tccgttggcc caccgagcca ttgctcgttt gaacaggtct 1620 

51 cacagaccat cagggtcgcg cgtgcaacgc acccgcacgg cagggctgtc ccaaggggga 1680 

52 cgggctggta gtggacgtca tcgcagctga tactcatcgc tcgattcctg ccagggcgtg 1740 
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53 atggacagtg ccgaccgagc atcccagctc attggcgata gctctcatgg acatgccagt 1800 

54 accgcgtagc tcccgtatct tctcgcgccg ttgctgtgcc cgggacaggt aggcctctcg 1860 

55 cggctctgat gtccaccgga tgatgctggc cgtcgatacg cctgtgcgtt cggcgagctc 192 0 

56 ccgcactgtt atcccgttgc gaggcagacg ctcagtcatg tgtggcctcc tcaattgcca 1980 

57 gtcgcacctg atccagtttg gtgctgcgtg cacgccgggc ctggccgctg gcgacgccgc 2040 

58 ctttgcggcc acgggcggat tggatggtcg agagtgtggc ttcgtagacg gcgataccgt 2100 

59 ctttccagat gcgggatcgc gtggtgatcc atcggtgaat gctggcggcg atggcgcggg 2160 

60 cttcagaggg tgggagttcc tcggtgtagc tggcgttgag ggcggtgact tcgcggtgga 2220 

61 gggcgagttc gaggccggcg ctgtcatgag tgggcaggta gttgcgcatg aggccggggc 2280 

62 gggtggccca ggtgcgagcg gcgtggaaga ttgcgcagtt gcggccgagt cctatggggg 234 0 

63 cgtcacggtg ggccttggtg tgctgccagc gcggggaggg catgtgtgtg ccgagttcgg 2400 

64 cctcgagctc ggcgaggctg cgcggttcgg tgtggatcca gtgcgtgtcc cagcctgagt 2460 

65 gtgtggggtt cttggtcatc aggcctgagt agccgatgtc cccttcgacg gcccggcgta 2520 

66 gcccttcggt aacggcggcg gcgtacgcga ggggcttccg tcgggcgtac tcggtgcggg 2580 

67 tgaacggctc gcgcagccac cagtgcaggt gcgcatgccc gttgctgctg ttttcgatca 2640 

68 cggcgttcgg catagggtga ttgcctgccg ctgacagtgc gcgcagggcg gagtcggggt 2700 

69 ggtcgacgtc gacgacgagc agattgctca gtgtctgcgg gttggcctcg atgtagcgcc 2760 

70 gctcaagtgc ggcagggcgg cgcattcgat agaccccctc aaggaagtcg tttgttgcca 2820 

71 aaggccagaa cggtagccac atttgctccc agtccccacc ttcccgcacc ccgtgtgtgt 2880 

72 ccatgctggt gaccgtcgca tgctctcgaa cgaaacctgg gcatttccct cggtgtgttc 2940 

73 aaagttaatc gtgaagccaa gtcagatgtt tcggtgtcgc caggcgcccc tcggtgtgtt 3000 

74 caaagttaat cgtgaagcca agtcagatgt ttcggtacgc ccctggggaa accccttcaa 3060 

75 gtccggcact cccaatgcct catcgtgcgc atgcaggttt gaaacggccg gcggcttaat 3120 

76 ttcgtcgcgc gaaaacctct ccacggtctc tgaagttcag caagaggtgt gcctgaattt 3180 

77 cagtgcgccc ggactcgcaa cgaagtgccc gcttgctgat ttgtcggcga ctcagtcggt 3240 

78 gccatgtacg gacgcgtgac gtttcaccac ggtgcgggcg agatcagcga tcttgacgtt 3300 

79 catgttctgt gaggttcgcc tgagaatgtc gaaggcgtcc tcagacgaca cgtcgtgcag 3360 

80 tgccatcagc agaccgatcg ccttaccgat ctcgcgattg ctctccagcc cgcgccgcag 342 0 

81 ggtgactgcc tcttctcccc aggtgatggc agccaccgtc acggctgcga atgacgcgag 3480 

82 cacggcagcg ggtcctgcgg cctcagtcgt gaacgccccc ggagtgtcac tgaagagatt 3540 

83 cagtgctcca actttgtggc cgtcgacgag cagacgaaat cccatgacgc cacgaaccgg 3600 

84 agtttgtgcg agcacctggc gcgccagccg cggccagccg gtgggtgtgg cgaggtcggg 3660 

85 atcgagttgt gcggcttcct cctctatcgc gtcgaggcac ggtccctccc cggtctggcg 372 0 

86 ctcgagatcg tctaccagcc gcgcgatgtc gtccgatgcg gccaccgtca cgggtaaatc 3780 

87 ccggccacgc aacatcaggc ttgcgtggtc acagccggga accagcaggg tggcggcgac 3 840 

88 acagaccgcc gcgtaaacct cggacggctc ggaaccgttg tagacaatgt cggcgagcgc 3900 

89 ggcgaatacg gtccccggat ccgacttgct gggggtagct gcgctctgcg cgtccgtcat 3960 

90 gacgttcctg ctttcaagcc cctctgggct gcgtcaaacg ctacgcggtg gaactgctcg 4020 

91 ggagtcatcc cagatggtgg agtgaaaggc gtgcgtttgg catcggccga ggcccgataa 4080 

92 ccgtcgtccg cgcggccgca gagcgctgag caagataggc acttacctca cgtaagtgag 4140 

93 gagtgatggg tgcgcgagcg atcgcgcacc ggagacaccg ggtgaaggct gggtccggta 4200 

94 gcgcagggac agcacgcggc gcagcctgac tcgctacctt gctgactcgc taccttgctg 4260 

95 actcgctacc ttgctgactc gctaccttgc tgactcgcta cctacgcggc gcagcaagat 432 0 

96 ggagacttag gtgccctaag tcgcagcccg cattcgcggg atccctgtcg gggcgctgcg 4380 

97 ctgggcggtg aaataccaac cttgcggttc ctgggaacag ctaaaccgaa cggtgtcgct 4440 

98 tcggcagtcg gggatgacga cgccaggtgc gctgggggct acaaaggaac gagatgctcg 4500 

99 ctggtgttac cgaggtggtg tgccaacgag aatccaactt gccgcagcgt ttggtgtttt 4560 

100 tgggggggaa gcggatctcc cgatgagggg cactgccacg cacactgagg ggcgatccgc 4620 

101 atgggtgttt cacgggcacg gtcacttgcg tactgggagc agcacgtctg gatcgaaacg 4680 
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102 cacgaccaca tatccccggg ccggtaccaa gtgccggccg agaaacgtgc atctcccgcg 4740 

103 gacaccgtgt cggcacctga gaaccaagcc gagtaattcc acgacgcaac agatccagat 4800 

104 ccgacacacc tcttgcactc aggagtgaat gtcggggcat tttcgcgcga cgaaattagg 4860 

105 atcgcggcta tgacgacgcc gcgatgcccc cgctgtgacc gcgagatcga gagcaacacc 4920 

106 agcgcactgc acggtcgcgg ccggccgcgg aagtggtgct cggacgagtg 4970 

109 <210> SEQ ID NO: 2 

110 <211> LENGTH: 1341 

111 <212> TYPE: DNA 

112 <213> ORGANISM: Rhodococcus sp. 

114 <400> SEQUENCE: 2 

115 cgtcatcgca gctgatactc atcgctcgat tcctgccagg gcgtgatgga cagtgccgac 60 

116 cgagcatccc agctcattgg cgatagctct catggacatg ccagtaccgc gtagctcccg 12 0 

117 tatcttctcg cgccgttgct gtgcccggga caggtaggcc tctcgcggct ctgatgtcca 180 

118 ccggatgatg ctggccgtcg atacgcctgt gcgttcggcg agctcccgca ctgttatccc 240 

119 gttgcgaggc agacgctcag tcatgtgtgg cctcctcaat tgccagtcgc acctgatcca 300 
12 0 gtttggtgct gcgtgcacgc cgggcctggc cgctggcgac gccgcctttg cggccacggg 360 

121 cggattggat ggtcgagagt gtggcttcgt agacggcgat accgtctttc cagatgcggg 42 0 

122 atcgcgtggt gatccatcgg tgaatgctgg cggcgatggc gcgggcttca gagggtggga 480 

123 gttcctcggt gtagctggcg ttgagggcgg tgacttcgcg gtggagggcg agttcgaggc 540 

124 cggcgctgtc atgagtgggc aggtagttgc gcatgaggcc ggggcgggtg gcccaggtgc 600 

125 gagcggcgtg gaagattgcg cagttgcggc cgagtcctat gggggcgtca cggtgggcct 660 

126 tggtgtgctg ccagcgcggg gagggcatgt gtgtgccgag ttcggcctcg agctcggcga 72 0 

127 ggctgcgcgg ttcggtgtgg atccagtgcg tgtcccagcc tgagtgtgtg gggttcttgg 780 
12 8 tcatcaggcc tgagtagccg atgtcccctt cgacggcccg gcgtagccct tcggtaacgg 840 
12 9 cggcggcgta cgcgaggggc ttccgtcggg cgtactcggt gcgggtgaac ggctcgcgca 900 

130 gccaccagtg caggtgcgca tgcccgttgc tgctgttttc gatcacggcg ttcggcatag 960 

131 ggtgattgcc tgccgctgac agtgcgcgca gggcggagtc ggggtggtcg acgtcgacga 102 0 

132 cgagcagatt gctcagtgtc tgcgggttgg cctcgatgta gcgccgctca agtgcggcag 1080 

133 ggcggcgcat tcgatagacc ccctcaagga agtcgtttgt tgccaaaggc cagaacggta 1140 

134 gccacatttg ctcccagtcc ccaccttccc gcaccccgtg tgtgtccatg ctggtgaccg 1200 

135 tcgcatgctc tcgaacgaaa cctgggcatt tccctcggtg tgttcaaagt taatcgtgaa 1260 

136 gccaagtcag atgtttcggt gtcgccaggc gcccctcggt gtgttcaaag ttaatcgtga 132 0 

137 agccaagtca gatgtttcgg t 1341 

140 <210> SEQ ID NO: 3 

141 <211> LENGTH: 744 

142 <212> TYPE: DNA 

143 <213> ORGANISM: Rhodococcus sp. 

145 <400> SEQUENCE: 3 

146 ctcgagatcg tctaccagcc gcgcgatgtc gtccgatgcg gccaccgtca cgggtaaatc 60 

147 ccggccacgc aacatcaggc ttgcgtggtc acagccggga accagcaggg tggcggcgac 12 0 

148 acagaccgcc gcgtaaacct cggacggctc ggaaccgttg tagacaatgt cggcgagcgc 180 

149 ggcgaatacg gtccccggat ccgacttgct gggggtagct gcgctctgcg cgtccgtcat 24 0 

150 gacgttcctg ctttcaagcc cctctgggct gcgtcaaacg ctacgcggtg gaactgctcg 300 

151 ggagtcatcc cagatggtgg agtgaaaggc gtgcgtttgg catcggccga ggcccgataa 360 

152 ccgtcgtccg cgcggccgca gagcgctgag caagataggc acttacctca cgtaagtgag 42 0 

153 gagtgatggg tgcgcgagcg atcgcgcacc ggagacaccg ggtgaaggct gggtccggta 480 

154 gcgcagggac agcacgcggc gcagcctgac tcgctacctt gctgactcgc taccttgctg 540 

155 actcgctacc ttgctgactc gctaccttgc tgactcgcta cctacgcggc gcagcaagat 600 

156 ggagacttag gtgccctaag tcgcagcccg cattcgcggg atccctgtcg gggcgctgcg 660 



file://C:\CRF4\Outhold\VsrJ82045 1 .htm 



RAW SEQUENCE LISTING DATE: 01/27/2005 

PATENT APPLICATION: US/10/820 , 451 TIME: 13:00:24 



Input Set : A:\Mtv07601.app 

Output Set: N:\CRF4\01272005\J820451.raw 

157 ctgggcggtg aaataccaac cttgcggttc ctgggaacag ctaaaccgaa cggtgtcgct 720 

158 tcggcagtcg gggatgacga cgcc 744 

161 <210> SEQ ID NO: 4 

162 <211> LENGTH: 91 

163 <212> TYPE: DNA 

164 <213> ORGANISM: Rhodococcus sp. 

166 <400> SEQUENCE: 4 

167 ggcgcctggc gacaccgaaa catctgactt ggcttcacga ttaactttga acacaccgag 60 

168 ggaaatgccc aggtttcgtt cgagagcatg c 91 

171 <210> SEQ ID NO: 5 

172 <211> LENGTH: 933 

173 <212> TYPE: DNA 

174 <213> ORGANISM: Rhodococcus sp. 

176 <400> SEQUENCE: 5 

177 atggacacac acggggtgcg ggaaggtggg gactgggagc aaatgtggct accgttctgg 60 

178 cctttggcaa caaacgactt ccttgagggg gtctatcgaa tgcgccgccc tgccgcactt 12 0 

179 gagcggcgct acatcgaggc caacccgcag acactgagca atctgctcgt cgtcgacgtc 180 

180 gaccaccccg actccgccct gcgcgcactg tcagcggcag gcaatcaccc tatgccgaac 240 

181 gccgtgatcg aaaacagcag caacgggcat gcgcacctgc actggtggct gcgcgagccg 300 

182 ttcacccgca ccgagtacgc ccgacggaag cccctcgcgt acgccgccgc cgttaccgaa 360 

183 gggctacgcc gggccgtcga aggggacatc ggctactcag gcctgatgac caagaacccc 42 0 

184 acacactcag gctgggacac gcactggatc cacaccgaac cgcgcagcct cgccgagctc 480 

185 gaggccgaac tcggcacaca catgccctcc ccgcgctggc agcacaccaa ggcccaccgt 540 

186 gacgccccca taggactcgg ccgcaactgc gcaatcttcc acgccgctcg cacctgggcc 600 

187 acccgccccg gcctcatgcg caactacctg cccactcatg acagcgccgg cctcgaactc 660 

188 gccctccacc gcgaagtcac cgccctcaac gccagctaca ccgaggaact cccaccctct 720 

189 gaagcccgcg ccatcgccgc cagcattcac cgatggatca ccacgcgatc ccgcatctgg 780 

190 aaagacggta tcgccgtcta cgaagccaca ctctcgacca tccaatccgc ccgtggccgc 840 

191 aaaggcggcg tcgccagcgg ccaggcccgg cgtgcacgca gcaccaaact ggatcaggtg 900 

192 cgactggcaa ttgaggaggc cacacatgac tga 93 3 

195 <210> SEQ ID NO: 6 

196 <211> LENGTH: 246 

197 <212> TYPE: DNA 

198 <213> ORGANISM: Rhodococcus sp. 

200 <400> SEQUENCE: 6 

201 atgactgagc gtctgcctcg caacgggata acagtgcggg agctcgccga acgcacaggc 60 

202 gtatcgacgg ccagcatcat ccggtggaca tcagagccgc gagaggccta cctgtcccgg 120 

203 gcacagcaac ggcgcgagaa gatacgggag ctacgcggta ctggcatgtc catgagagct 180 
2 04 atcgccaatg agctgggatg ctcggtcggc actgtccatc acgccctggc aggaatcgag 240 
205 cgatga 246 

208 <210> SEQ ID NO: 7 

209 <211> LENGTH: 78 

210 <212> TYPE: DNA 

211 <213> ORGANISM: Rhodococcus sp. 

213 <400> SEQUENCE: 7 

214 ctgactcgct accttgctga ctcgctacct tgctgactcg ctaccttgct gactcgctac 60 

215 cttgctgact cgctacct 78 

218 <210> SEQ ID NO: 8 

219 <211> LENGTH: 110 
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220 <212> TYPE: DNA 

221 <213> ORGANISM: Rhodococcus sp. 

223 <400> SEQUENCE: 8 

224 ccctcggtgt gttcaaagtt aatcgtgaag ccaagtcaga tgtttcggtg tcgccaggcg 60 

225 cccctcggtg tgttcaaagt taatcgtgaa gccaagtcag atgtttcggt 110 

228 <210> SEQ ID NO: 9 

229 <211> LENGTH: 27 

230 <212> TYPE: DNA 

231 <213> ORGANISM: Artificial Sequence 

233 <220> FEATURE: 

234 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

235 primer 

237 <400> SEQUENCE: 9 

238 ggaattcgct gtcatgagtg ggcaggt 27 

241 <210> SEQ ID NO: 10 

242 <211> LENGTH: 30 

243 <212> TYPE: DNA 

244 <213> ORGANISM: Artificial Sequence 

246 <220> FEATURE: 

247 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

248 primer 

250 <400> SEQUENCE: 10 

251 cctgcagaaa tgcccaggtt tcgttcgaga 30 

254 <210> SEQ ID NO: 11 

255 <211> LENGTH: 20 

256 <212> TYPE: DNA 

257 <213> ORGANISM: Artificial Sequence 

259 <220> FEATURE: 

260 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

261 primer 

263 <400> SEQUENCE: 11 

264 cacacctaaa ctgacatgct 2 0 

267 <210> SEQ ID NO: 12 

268 <211> LENGTH: 18 

269 <212> TYPE: DNA 

270 <213> ORGANISM: Artificial Sequence 

272 <220> FEATURE : 

273 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

274 primer 

276 <400> SEQUENCE: 12 

277 gcaatatggt cttcgcct 18 

280 <210> SEQ ID NO: 13 

281 <211> LENGTH: 20 
2 82 <212> TYPE: DNA 

283 <213> ORGANISM: Artificial Sequence 
285 <220> FEATURE: 

2 86 <22 3> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

2 87 primer 

289 <400> SEQUENCE: 13 
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